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( Abstract 

We give a randomized algorithm that determines if a given graph has a simple path of length 
at least k in 0(2 k ■ poly(n, fc)) time. 

0\ '. 1 Introduction 

The fc-path problem is to determine if a given graph contains a simple path of length at least 
CD ■ k, and if so, produce such a path. When k is given as a part of the input, the problem is well- 

c/2 ! known to be NP-complete. The general problem has many practical applications (cf. O [15] ) . The 

trivial algorithm enumerating all possible fc-paths in an n-node graph uses Q(n k ) time, so it is 
only polynomial for k = 0(1). The first algorithm to reduce the runtime dependency on k was 
given by Monien [13] whose algorithm runs in 0*(k\) time (the O* notation suppresses poly(n,A;) 



> 



\ factors). Hence the case k < (log n)j (log log n) is still polynomial time solvable. For years it 

was not known if the 0(logn)-path problem was in polynomial time; a breakthrough by Alon, 
Yuster, and Zwick [2] finally led to such an algorithm. They gave a randomized algorithm running 
\ in 0*((2e) k ) < 0*(5A4 k ) time, and a deterministic 0*{c k ) time algorithm, where c is a large 

constant. Since it has been known for many years prior that when k = n the problem is solvable in 
0*(2 k ) time d [9] , it is natural to ask if there is an algorithm that can match this runtime for 
all values of k. 

It has been only recently that faster A:-path algorithms have appeared in the literature. In 2006, 
two groups independently discovered <3*(4 fc ) randomized algorithms and 0*(c k ) deterministic algo- 
rithms, with c = 16 [10] and c = 12.5 [6j. Very recently, Koutis [UJ presented a novel randomized 
algorithm for A;-path that runs in 0*(2 3fc / 2 ) < 0*(2.83 fc ) time. In this note, we extend his result 
to obtain an 0*{2 k ) time algorithm. Koutis shows how to detect if a graph has a A:-subgraph with 
an odd number of /c-paths in 0*{2 k ) time. By augmenting his approach with more random choices 
and some additional ideas, we can find a fe-path in roughly the same running time. As mentioned 
above, the best known algorithms for finding a Hamilton path in an n-node graph run in 0*(2 n ) 
time and are quite old. Therefore any significant improvement in the runtime dependence on k 
given by our algorithm would imply a faster Hamilton path algorithm, which has been an open 
problem for over forty years. We do not wish to insist that our algorithm is optimal, but rather 
that further progress would entail a substantial breakthrough in algorithms for NP-hard problems. 
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2 Preliminaries 



Most of our notation is standard, however we do require some notions from algebra that are not often 
used in graph algorithms. Let F be a field and G be a multiplicative group (its binary operation 
is written as a multiplication). The group algebra F[G] is an algebraic object that incorporates 
properties of both objects, defined as follows. Elements of F[G] have the form 

Y a a9' 

geG 

where each a g G F. That is, the elements are formal sums over the group elements, with coefficients 
from the field. Addition in F[G] is defined in a point- wise manner: 

E a 9 9 + \J2 b 99] = ^(°9 + b g )g. 
V<?eG / \ g eG J g eG 

Multiplication has the form of a convolution: 



K geG J \geG J geG \heG J 
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Note the above definition coincides with the one in [TT]. The above operations define a ring with 
and 1, where G F[G] is the element X^gG a g9 sucn that au a g are equal to G F, and 1 G F[G] 
is the multiplicative identity 1 G G of the group. 

In our algorithm, we work over the group algebra GF (2 e )[7j2], for particular integers k,£ > 0. 
Here is the group of binary /c-vectors, endowed with componentwise addition modulo 2 as its 
operation. GF(2r) is the unique field on 2 l elements. We use Wo to denote the all-zeros vector (the 
identity) of Z*. Note that every v G Z2 is its own inverse: = Wq. Every element in the algebra 
has the form Yl,v& k a v v > where a v G GF(2 e ). 



Example The elements of GF{2 2 ) can be represented as the four polynomials 0, 1, x, 1 + x over 
GF(2), where computations are done modulo x 2 + x + 1. For example, x 3 = x ■ x 2 = x ■ (1 + x) = 
x + x 2 = 1 in GF{2 2 ). Over GF(2 2 )[Zl], 
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and over F[Z|] in general, 
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3 Algorithm for the £>Path Problem 



Fix an underlying graph G in the following, with vertex set {1, . . . , re}. Let F be a field, let A be 
the adjacency matrix of G, and let x\, ... ,x n be variables. Define a matrix B[i,j] = A[i,j]x{. Let 
1 be the row n- vector of all l's, and x be the column vector defined by x[i] = X{. Define the A;- walk 
polynomial to be Pk(xi, . . . , Xk) = 1 • B k ~ 1 ■ x. 

Proposition 1 



Clearly, there is a A;-path in G iff Pk{x\, . . . ,x n ) contains a multilinear term. We give a ran- 
domized algorithm R with the property that: 

• if Pk has a multilinear term, then Vr\R outputs yes] > 1/5, 

• if Pk does not have a multilinear term, then R always outputs no. 

In fact, the statement we can prove is more general. 

Theorem 3.1 Let P(x\, . . . , x n ) be a polynomial of degree at most k, represented by an arithmetic 
circuit of size s(n) with + gates (of unbounded fan-in), x gates (of fan-in two), and no scalar 
multiplications. There is a randomized algorithm that on every P runs in O* (2 fc s(n)) time, outputs 
yes with high probability if there is a multilinear term in the sum-product expansion of P, and 
always outputs no if there is no multilinear term. 

Remark 1 We may assume without loss of generality that every multilinear term of P has degree 
at least k, and at least one multilinear term has degree exactly k. If not, let k' < k be the minimum 
degree of a multilinear term in P. We can try all j = 1, . . . ,k and multiply the final output of the 
circuit for P by j new variables x n+ \, . . . ,x n +j, obtaining a polynomial P 3 , which we feed to the 
randomized algorithm. Note that when j = k — k! , our assumption holds. 

By observing that Pk can be implemented with a circuit of size 0(k(m + n)) where m is the 
number of edges in G, the fc-path algorithm is obtained. We begin the proof with a description of 
the algorithm. The basic idea is to substitute random group algebra elements for the variables such 
that all non- multilinear terms in P evaluate to zero and some multilinear terms survive. Then we 
augment the scalar-free multiplication circuit with random scalar multiplications over a field large 
enough that the remaining multilinear polynomial evaluates to nonzero with decent probability. 



Algorithm Pick n uniform random vectors V\,...,v n from Z(>. For each multiplication gate gi 
in the circuit for P, pick a uniform random W{ € F \ {0}. Insert a new gate that multiplies the 
output of gi with Wi , and feeds the output to those gates that read the output of gi . Let P' be the 
new polynomial represented by this arithmetic circuity Output yes iff P'(Wq + . . . , Wo + v n ) ^ 0. 

In the evaluation of the fc-path polynomial Pk, the algorithm corresponds to picking random yi.j.c in F for 
c = 1, . . . , k — 1, i,j = 1, . . . , n, letting B c [i,j] = Vi,j,cB[i,j], then evaluating Pf.(xi, . . . , x n ) = 1 • Bfc-i • • • B\ ■ x on 
the appropriate vectors. 





is a walk in G 



Set F = GF(2 3+1 °s fc ). 
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Runtime Let us describe one way to implement the algorithm efficiently. The only non-trivial 
step is the final polynomial evaluation. By definition, the evaluation of P'{Wq + v\, . . . , Wo + v n ) 
takes 0(s(n)) arithmetic operations. However, since evaluation takes place over FfZ*], we need 
to account for the cost of arithmetic in the group algebra. Elements in -F[Zr>] can be naturally 
interpreted as vectors in F 2k . Addition of these vectors (as elements in i^[Z§]) can be done in 
0(2 k log \F\) time with a component- wise sum. Multiplication of vectors u and v over the group 
algebra can be done in 0(k2 k log 2 \F\) time by a Fast Fourier Transform style algorithm, as we now 
describe. 

For simplicity, let £ = 3 + log 2 k. Represent elements of F = GF(Qr) as univariate polynomials 
over GF{2) of degree at most I in the usual way, so the entries of u and v are degree-^ polynomials. 
Over the ring C[x], multiply u and v with the matrix Hk for the discrete Fourier transform on 
Z 2 (also called the Walsh-Hadamard transform) in 0(k2 k M(£)) time (cf. [12]) where M(d) is the 
runtime for computing the product of two degree-cf univariate polynomials over GF{2). Since i is 
small, it suffices to use the bound M(£) < 0{£ 2 ). Take the pointwise product of the two resulting 
vectors obtaining a vector w, and multiply w with (note H^ 1 = Hf., so this is the inverse of 
the transform). The resulting vector x contains 2 k polynomials of degree at most 2£. Reduce each 
polynomial modulo an irreducible degree-^ polynomial over GF(2), in 0(2 k M(£)) time. (For a 
discussion of how to obtain irreducible polynomials, cf. |16|.) This has the effect of mapping our 
results in C[x] back down to GF(2 t ). Overall, the evaluation of P' takes at most O* (2 k s(n)) time. 

We note that while the above computation naively needs Q(2 k ) space, detecting if P' evaluates to 
zero can be done in 0(poly(n, k)) space using (for example) the representation-theoretic technique 
of Koutis [11] . For the sake of brevity, we will not concern ourselves with this issue. 

Correctness The crucial observation of Koutis [11] is that, for any V{ E Z*, 

(W + Vif = Wq 2 + 2vi + vf = Wo + + Wo = mod 2. 

Therefore all squares in P vanish in P'(Wq + v\,. . . ,Wo + v n ), since F has characteristic 2. It 
follows that if P(x\, . . . , x n ) does not have a multilinear term, then P'(Wq + v\, . . . , Wo + v n ) = 
over .FfZjj], regardless of the choice of V{. 

In the remaining paragraphs, we prove that if the sum-product expansion of P(x\, . . . , x n ) has 
a multilinear term, then P'(Wo + v%, . . . , Wo + v n ) ^ with probability at least 1/5, over the 
random choices of w^s and Vi's. By Remark [TJ we may assume that every multilinear term in the 
sum-product expansion of P has the form c ■ x^ ■ ■ ■ Xj , where k' > k and c G Z. For each such 
term, there is a corresponding collection of multilinear terms in P', each of the form 

k' 

Wi ■ ■■Wk'-l Y[( W + Vij), 

where the sequence w\, . . . ,Wk'-i is distinct for every term in the collection (as the sequences of 
multiplication gates gi, ■ ■ ■ ,gk'-i are distinct). Note these terms do not have leading coefficients, 
since there are no scalar multiplications in the arithmetic circuit. 

Consider a monomial n^if^o + v j) i n the polynomial (disregarding the Wi's for the moment). 
Koutis [TT] proved that if the i vectors vi,...,Vi are linearly dependent, this monomial vanishes 
modulo 2. We observe that his proof works over any field of characteristic two. 
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Proposition 2 (Koutis) If vi, . . . ,Vi € Z| are linearly dependent over GF{2), then n}=i(^o + 
Vj ) = in F[Z%]. 

Proof. If vx, ■ ■ ■ , V{ are linearly dependent, there is a nonempty subset T of the vectors that sum 
to the all- zeros vector. In FfZg], this is equivalent to 

Hvj=w . 

Let S C T be arbitrary. Multiplying both sides by n.je(SAT) 

n^= n v r 

jes je(SAT) 

Therefore IXj-£r(Wo + v j) = Sscr flljes^i) = *-* m °d 2, since each product appears twice in the 
sum. Hence n}=i(^ / o + Vj) = over FfZ^], since F is characteristic 2. □ 
Therefore linearly dependent vectors lead to a cancellation of terms. On the other hand, when 
vi, . . . ,Vi are linearly independent, n}=i(^ / o + Vj) is just the sum over all vectors in the span of 
vi, . . . ,Vi, since each vector in the span is of the form Y\j£S v j f° r some SC. [i], and there is a 
unique way to generate each vector in the span. 

Proposition 3 7/ v\, . . . ,Vk € Zg are linearly independent over GF{2), then IJ* = i(Wo + Vj) = 

By Propositions [2] and [3l and the fact that any k' > k vectors are linearly dependent, P'(Wq + 
Wo + v n ) evaluates to either 0, or c Yliv&z k v ^ or some c £ F. The final piece of our argument 
shows that if P has a multilinear term, then c ^ with probability at least 1/5. 

The vectors V(_ x , . . . , V£ k chosen for the variables in a multilinear term of P are linearly indepen- 
dent with probability at least 1/4, because the probability that a random k x k matrix over GF{2) 
has full rank is at least 0.28 > 1/4 (cf. [5], Lemma 6.3.1). Hence in P'(Wo + v±, . . . , Wo + v n ), there 
is at least one multilinear term in P corresponding to a set of k linearly independent vectors, with 
probability at least 1/4. 

Let S be the set of those multilinear terms in P which correspond to k linearly independent 
vectors in P'(Wo + v±,...,Wo + v n ). Then the above coefficient c = Yli c i f° r some Cj G F 
corresponding to the ith multilinear term in S. Conditioned on S ^ 0, we claim that Y2i °i = 
with probability at most 1/2 3 . Each coefficient q comes from a sum of products of k — 1 elements 
Wi t x, • • • , Wi,k-i corresponding to some multiplication gates g^i, . . . , g^k-i i n the circuit. (In the k- 
path case, note that each q is a sum of products of the form yi lj i 2 ,xyi2,i3, 2 ' ' ' Vik-iAkik-l-) Construing 
the Wi's as variables, the sum Q(wx, ■ ■ ■ , w s (n)) = J2i c i is a degree-k polynomial over F in the 
variables. Assuming S ^ 0, Q is not identically zero. (Note each monomial in Q has coefficient 
1.) By the Schwartz-Zippel Lemma |14j . the algorithm's random assignment to the variables of 
Q results in an evaluation of G F with probability at most k/\F\ = 1/2 3 . Since S ^ with 
probability at least 1/4, the overall probability of success is at least 1/4 • (1 — 1/2 3 ) > 1/5. 
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Constructing a Path The algorithm R merely detects if a graph has a /c-path. We note that 
an 0*{2 k ) algorithm producing a /c-path (when it exists) is easy to obtain; let us briefly outline one 
possible algorithm of this kind. For an arbitrary node we remove Vi from the graph and run the 
/c-path detection algorithm for O(logn) trials, using new random bits for each trial. If the algorithm 
outputs yes in some trial, we recursively call our algorithm on the graph with v i removed, returning 
the /c-path that it returns. Otherwise, we add v i back to the graph and move to the next candidate 
node Uj.fi, noting that such a move occurs at most k times (with high probability). Hence we can 
bound the runtime with the recurrence 

T(n) < O* (2 k ■ klogn) + T(n — 1), 

which is 0*(2 k ). The overall probability of error can be bounded by a constant less than 1, since 
the probability that all O(logn) trials result in error is inversely polynomial in n. 

4 Conclusion 

We end with two interesting open questions. We conjecture that both can be answered affirmatively 

• Let G be a graph with arbitrary costs on its edges. The Short Cheap Tour problem is to 
find a path of length at least k where the total sum of costs on the edges is minimized. This 
problem is fixed-parameter tractable, in fact: 

Theorem 4.1 Short Cheap Tour can be solved in 0*(4 k ) time by a randomized algorithm 
that succeeds with high probability. 

We omit the proof here; our algorithm is a simple extension of the divide-and-color approach 
for solving /c-path [ID] . Can Short Cheap Tour be solved in 0*(2 k ) time? The algorithm 
of this paper does not appear to extend to weighted graphs. 

• Is there a deterministic algorithm for /c-path with the same runtime complexity as our al- 
gorithm? This question was also raised by Koutis [IT], however our algorithm looks more 
difficult to derandomize. Our argument relies on the fact that polynomial identity testing is 
in RP, and it is known that a polytime derandomization of this would imply strong circuit 
lower bounds [8]. 
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